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Randomized Gossip Algorithm with 
Unrehable Communication* 

Guodong Shi, Mikael Johansson and Karl Henrik Johansson^ 

Abstract 

In this paper, we study an asynchronous randomized gossip algorithm under unrehable 

communication. At each instance, two nodes are selected to meet with a given probability. 

When nodes meet, two unreliable communication links are established with communication 

O^ I in each direction succeeding with a time- varying probability. It is shown that two particularly 

interesting cases arise when these communication processes are either perfectly dependent 

^ ' or independent. Necessary and sufficient conditions on the success probability sequence are 

00 ■ 
0\J ' proposed to ensure almost sure consensus or e-consensus. Weak connectivity is required 

o : 

\^ , when the communication is perfectly dependent, while double connectivity is required when 

CO . the communication is independent. Moreover, it is proven that with odd number of nodes, 

O ■ 

^S| ' average preserving turns from almost forever (with probability one for all initial conditions) 

for perfectly dependent communication, to almost never (with probability zero for almost 

. all initial conditions) for the independent case. This average preserving property does not 

^ , hold true for general number of nodes. These results indicate the fundamental role the node 



C^ 



interactions have in randomized gossip algorithms. 
Keywords: Gossip algorithms, Unreliable communication, Consensus, Threshold 

1 Introduction 

Peer-to-peer networks, sensor networks and social networks constitute a new generation of engi- 
neering systems that has grown tremendously in importance over the last decade. The absence 
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of a central decision-making entity, the unprecedented number of interacting nodes, the time- 
varying topology of node interactions, and the unreliability of nodes are key challenges for the 
analysis and design of these systems. Gossiping algorithms, in which each node exchanges data 
and decisions with at most one neighboring node in each time slot, have proven to be a ro- 
bust and efficient way to structure distributed computation and information processing over 
such networks |10l [9l \TT\ [T4] . A limitation of the current literature is that while it allows node 
interactions to be random, it assumes that the probabilities that two specific nodes interact 
is constant in time, and that when two nodes interact, both nodes completely and correctly 
execute the proposed algorithm. However, unreliable communication in wireless networks, and 
asymmetry of trust in social networks challenge such assumptions. This paper develops a frame- 
work for analysis of gossip algorithms that separates the random process for node interactions 
with the random process for successful information exchange and algorithm execution, and al- 
lows time-varying success probabilities for these operations. Necessary and sufficient conditions 
for the underlying graph structure and probability sequences are developed to ensure a.s. or 
e-consensus under both perfectly dependent communication, in which the bidirectional message 
exchange between nodes either succeeds of fails, and independent communication, in which the 
success of communication in each direction is independent of the outcome in the other direction. 
Gossip algorithms for distributed averaging arise in many applications, such as load balancing 
in parallel computing |181 [T9] , coordination of autonomous agents [22j , distributed estimation 
[40j and analysis of opinion dynamics [43j . A central problem here is to analyze if a given gossip 
algorithm converges to consensus, and to determine the rate of convergence of the consensus 
process: Karp et al. [9] derived a general lower bound for synchronous gossiping; Kempe et 
al. [lOj proposed a randomized gossiping algorithm on complete graphs and determined the 
order of its convergence rate; and Boyd et al. [12j established both lower and upper bounds 
for the convergence time of synchronous and asynchronous randomized gossiping algorithms, 
and developed algorithms for optimizing parameters to obtain fast consensus. In [43j, a gossip 
algorithm was used to describe the spread of misinformation in social networks, where the state 
of each node was viewed as its belief and the randomized gossip algorithm characterized the 
dynamics of the belief evolution. A detailed introduction to gossip algorithms can be found in 



More generally, consensus problems on graphs have been investigated by researchers from 
many disciplines, including computer science [181 US]) engineering [271 ESI EOl 1121 [ZD [2H] and 



social sciences \n\ 144]. Deterministic consensus algorithms have been studied extensively both 
for time-invariant and time-varying communication graphs. Efforts have typically been devoted 
to finding connectivity conditions which ensure convergence to consensus for the network model 
under consideration |2T1 [22l HH [291 [201 [281 [251 [26] . Randomized consensus algorithms, motivated 
by the stochastic information flow over networks, have also been considered [301 ED EH [351 
\32\ I40j . Many sufficient and/or necessary conditions have been established that guarantee a 
global consensus with probability one [301 EH [Ml [351 E2l [39] under different settings for the 
randomization of the communication graph and convergence rates have also, in some cases, 
been established [34l [35l [36] . 

Most of the existing work on randomized consensus algorithms have focused on the influence 
of randomness in the underlying communication graph on the agreement seeking process. In [121 
[33ll43j. the communication graph was determined by a stochastic matrix whose elements specify 
the probability that each pair of nodes is selected to execute the gossip algorithm with each other 
in a given time instant. In [37], the communication graph was described by a finite-state Markov 
chain where each state corresponds to a specific realization of the communication graph, and 
conditions for guaranteeing almost sure consensus were given. In [30], the authors studied 
linear consensus dynamics with communications graphs given by a sequence of independent 
and identically distributed (i.i.d.) Erdos-Renyi random graphs; the analysis was later extended 
to directed Erdos-Renyi graphs in [31]. Mean-square performance for consensus algorithms 
over i.i.d. random graphs was studied in [33j and the influence of random packet drops was 
investigated in [341 I36j . Distributed consensus over random graphs with graph independence 
was also studied in |32[ I40j . In all these works, the randomness in the underlying communication 
graph describes the randomness in the process of initiating interactions. Even if two nodes meet 
with each other, the information exchange could still fail, or be only partially executed (e.g., only 
by one of the two nodes) due to unreliable communication between nodes and the asymmetry 
of trust. 

This paper studies an asynchronous randomized gossip algorithm under unreliable commu- 
nication. A key feature of the proposed analysis framework is that it separates the random 
process for initiating interactions between node pairs from the random process for successful 
communication between two interacting nodes, and that it considers time-varying success prob- 
abilities for inter-node communication. Both perfectly dependent communication, in which the 
bi-directional message exchange between two nodes either succeeds or fails, and independent 



communication, in which the success of communication in each direction is independent of the 
outcome in the other direction, are considered. The perfectly dependent and independent node 
communication reflects directly the symmetry of the gossiping algorithms |33] . We derive neces- 
sary and sufficient conditions on the underlying graph structure and the communication success 
probability sequence to ensure a.s. consensus or e-consensus under perfectly dependent and inde- 
pendent communication, respectively. We investigate the impact of communication dependence 
on the behavior of the algorithm, and show how the underlying requirement for reaching a.s. 
consensus goes from weak connectivity in the perfectly dependent case to double connectivity 
in the independent case, while the probability of preserving the average decreases from one for 
perfectly dependent communication to zero for the independent case when the number of nodes 
is odd. 

The rest of the paper is organized as follows. In Section 2, some preliminary concepts 
are introduced. We present the network model, the randomized gossip algorithm, and the 
standing assumptions in Section 3. Then main results for perfectly dependent and independent 
communication are given in Sections 4 and 5, respectively. Finally, concluding remarks are given 
in Section 6. 

2 Preliminaries 

In this section, we recall some basic definitions from graph theory ^, stochastic matrices [2] 
and Bernoulli trials [1]. 

2.1 Directed Graphs 

A directed graph (digraph) Q = (V, £) consists of a finite set V of nodes and an arc set f C V x V. 
An element e = {i,j) G £^ is called an arc from node i E V to j S V. If the arcs are pairwise 
distinct in an alternating sequence ?;oeiUie2t'2 • • • etVk of nodes f j G V and arcs ej = (wj_i, u j) E £ 
for i = 1,2, . . . ,k, the sequence is called a (directed) path with length k. A path with no repeated 
nodes is called a simple path. A path from i to j is denoted as i —^ j, and the length of i — > j 
is denoted as \i — )• j\. If there exists a path from node i to node j, then node j is said to be 
reachable from node i. Each node is thought to be reachable by itself. A node v from which 
any other node is reachable is called a center (or a root) of ^. A digraph Q is said to be strongly 
connected if it contains path i ^ j and j —> i for every pair of nodes i and j, and quasi- strongly 



connected if G has a center [3] . 

The converse graph, Q of a digraph Q = (V, £"), is defined as the graph obtained by reversing 
the orientation of ah arcs in £. The distance from i to j in a digraph ^, d{i,j), is the length 
of a shortest simple path i ^ j if j is reachable from i, and the diameter of (y is diam(^)= 
max{d{i,j)\i,j G V, j is reachable from i}. 

The union of two digraphs with the same node set Qi = (V, £i) and G2 = (V, £^2) is defined as 
Qi^Q2 = (V, £'1 U£'2); we denote ^1 C Q2 if £^1 C £^2- A digraph Q is said to be bidirectional if for 
every two nodes i and j, (i, j) G iS if and only if (j, i) G £^ . A bidirectional graph Q is said to be 
connected if there is a path between any two nodes. A digraph Q is said to be weakly connected 
if it is connected as a bidirectional graph when all the arc directions are ignored. Strongly or 
quasi-strongly connected digraphs are hence always weakly connected. 

2.2 Stochastic Matrices 

A finite square matrix M = [rriij] G E,"^" is called stochastic if rriij > for all i,j and ^ ■ ruij = 1 
for all i. For a stochastic matrix M, introduce 

6{M) = maxmax|mQ,j — 7Ti^,|, A(M) = 1 — min > min{mQ,,-,m^,}. (1) 

j a,l3 a.j3 ^ — ' 
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If \{M) < 1 we call M a scrambling matrix. The following lemma can be found in [16]. 

Lemma 1 For any k > 1 stochastic matrices Mi , . . . , M^ , 

k 
6{Mk---M2Mi)<'[[XiMi). (2) 

i=l 

A stochastic matrix M = [rriij] G H"'^" is called doubly stochastic if also M^ is stochastic. 

Let P = [pij] G K,"^" be a matrix with nonnegative entries. We can associate a unique 
digraph Gp = {V,£p} with P on node set V = {1, . . . ,n} such that {j,i) G £'p if and only if 
Pij > 0. We call Gp the induced graph of P. 

2.3 Bernoulli Trials 

A sequence of independently distributed Bernoulli trials is a finite or infinite sequence of inde- 
pendent random variables QSq, 5Si, *B2, • • • , such that 

(i) For each A; > 0, 55^ equals either or 1; 



(ii) For each A; > 0, the probabihty that 53^ = 1 is pk- 
We call Pk the success probability for time k. The sequence of integers 

0<Ci<C2<---: *B^™ = 1 (3) 

is called the Bernoulli (success) sequence associated with the sequence of Bernoulli trials with 
Cm marking the time of the m'th success. 

3 Problem Definition 

In this section, we present the considered network model and define the problem of interest. 

3.1 Node Pair Selection Process 

Consider a network with node set V = {!,..., n} {n > 3). Let the digraph Qq = (V, <?o) 
denote the underlying graph of the considered network. The underlying graph indicates potential 
interactions between nodes. We use the asynchronous time model introduced in [12] to describe 
node interactions. Each node meets other nodes at independent time instances defined by a 
rate-one Poisson process. This is to say, the inter-meeting times at each node follows a rate-one 
exponential distribution. Without loss of generality, we can assume that at most one node is 
active at any given instance. Let Xi{k) G R denote the state (value) of node i at the fc'th meeting 
slot among all the nodes. 

Node interactions are characterized by an n x n matrix A = [aij], where aij > for all 
i,j = 1, . . . ,n and aij > if and only if {j,i) € Sq- We assume A is a stochastic matrix. The 
meeting process is defined as follows. 

The node pair selection process for the gossip algorithm is defined as follows. 

Definition 1 (Node Pair Selection Process) At each time k > 0, 

(i) A node i ^ V is drawn with probability 1/n; 

(ii) Node i picks the pair {i,j) with probability Oij. 

Note that, by the definition of the node pair selection process, the underlying graph Qq is 
actually the same as Qa, the induced graph of the node pair selection matrix A. For Qq, we use 
the following assumption. 



Al. (Weak Connectivity) The underlying graph Gq is weakly connected. 

Remark 1 Node that, node pairs {i,j) and {j,i) have different meaning according to the node 
pair selection process. When {i,j) is selected, node i is the node first picked and then i picks j. 
While pair (j, i) is selected means j is first picked who picks i later. 

Remark 2 The node pair selection matrix A being a stochastic matrix has natural meaning that 
node i's decisions form a well-defined probability space when it is selected at time k. However, 
it is not essential for the following discussions in the sense that all the results still stand even 
this assumption is replaced by the condition that each row sum of A is no larger than one. The 
same assumption is made in JT^ \4^ - 

Remark 3 Let Qj^^j^t denote the induced graph of matrix A + A^ . Apparently A + A^ is 
symmetric, and thus Qy^j^y^r is a bidirectional graph. It is not hard to see that Go is weakly 
connected if and only if Ga+a^ ^^ a connected bidirectional graph. 



Remark 4 In the standing assumption of ]1^ - the matrix A is supposed to have its largest 
eigenvalue equal to 1 and all other n — 1 eigenvalues strictly less than 1 in magnitude. This 
condition is equivalent with that Go is quasi- strongly connected 1281 \25^ . On the other hand, in 
IJSil , Go is assumed to be strongly connected. Therefore, Assumption 1 is a weaker assumption, 
compared to the one in fl^ [J^ . 

Remark 5 In order to guarantee convergence for the gossip algorithm discussed below, Al can- 
not be further weakened based on the following argument. Let us just assume Al does not hold 
true. Then there will be two disjoint node sets Vi, V2 C V such that there is no link connecting 
the two sets. As a result, nodes in Vi,i = 1,2 can only communicate with nodes belonging to 
the same subset. Therefore, the network is essentially divided into two isolated parts, and a 
convergence for the whole network is thus impossible. 

3.2 Node Communication Process 

When pair {i,j) is selected, both nodes try to set their states equal to the average of their 
current states. To this end, two communication links with opposite directions are established 
between the two nodes. 

Let {P^}'o^ and {-P^ljf be two deterministic sequences with < P^,P^ < 1 for all k. The 
node communication process is defined as follows. 



Definition 2 (Node Communication Process) Independent with time, node states, and the node 
pair selection process, 

(i) P(E^) = P^ with E^ = {^node i receives Xj{k) when {i,j) is selected at time A;|; 

(a) P(E^) = Pf^ with E^ = {^node j receives Xi{k) when {i,j) is selected at time A;}. 

If a node fails to receive the value of the other node, it will keep its current state. Note that 
we do not in general impose independence between i receiving Xj{k) and j receiving Xi{k) when 
pair {i,j) is selected. In fact, we will study how such potential dependence in the communication 
process influences the convergence of the gossip algorithm. 

Remark 6 A randomized gossip algorithm can also be viewed as belief propagation in a social 
network, where Xi{k) represents the belief of node i. Then the communication process naturally 
captures the loss of 'trust ' when two nodes meet and exchange opinions 14 3[ \44^ - Therefore, from 
a social network viewpoint, the discussion in this paper on the convergence property of the gossip 
algorithm establishes the influence of missing 'trust' in belief agreement. 

3.3 Problem 

Let the initial condition be x^ = x^ko) = (xi(A;o) . . . Xn{kQ)Y' £ IK-") where fco ^ is an arbitrary 
integer. According to the node pair selection process and the node communication process, the 
iteration of the gossip algorithm can be expressed as: for k > ko, 



Xi{k + 1) 



(4) 



^Xi{k) + ^Xj{k), if A^^*'-?') happens 
Xi{k), otherwise, 

where 

■Mp'' = \ {i,j) is selected or {j,i) is selected, and i receives Xj{k) at time k > 

denotes the event that node i successfully updates at time k. According to the definitions above, 
we have 

y K J ^ K ^ K y K J ^ K ^ K 

Therefore, the two events, M^.' and Mj! , are not necessarily symmetric in their probabilities, 
due to the potential asymmetry of the meeting matrix A. 



In this paper, we study the convergence of the randomized gossip consensus algorithm and 
the time it takes for the network to reach a consensus. Let 

x{k;ko,x°) = [xi{k;ko,xi{ko)) . . .Xn{k;ko,Xn{ko))j G E" 

be the random process driven by the randomized algorithm ([3]). When it is clear from the 
context, we will identify x(k;kQ,x^) with x{k). 
Denote 

H{k) = max Xi{k), h{k) = min Xi{k) 

i=l,...,n i=l,...,n 

as the maximum and minimum states among all nodes, respectively, and define T-L{k) = H{k) — 
h{k) as the consensus metric. We introduce the following definition. 

Definition 3 (i) A global a.s. consensus is achieved if 

P( lim nik) = 0) = 1 (5) 

A;— ^-oo 

for any initial condition x^ G R". 
(ii) Let the e- computation time be 

Then a global a.s. e-consensus is achieved if 

Tcom(e) = 0(loge"^) (7) 

where by definition /(e) = 0((7(e)) means that limsup^^g /(^)/5'(^) < oo is a nonzero constant. 

Remark 7 A global a.s. only requires that T-L{t) will converge to zero with probability one. If it 
is further required that the convergence speed is sufficiently fast, we use global a.s. e-agreement. 
The e-computation Tcom(e) is essentially equivalent with the definition of e-averaging time in 
Jig) /, which is a probabilistic version of similar concepts used to characterize the convergence 
rate of deterministic consensus algorithms in the literature, e.g., \4^ . 

Recall that until now, when i is selected to meet node j at time k, no assumption has been 
made on the dependence between the communication from i to j, and the one from j to i. In 
the following two sections, we will discuss the convergence of the considered randomized gossip 
algorithm with perfectly dependent and independent communication, respectively. We will show 
that the dependence in the node communication plays a critical role in determining the behavior 
of the gossip algorithm. 



W.)^™p,nf{t-.„: P(m>.)<.}. (6) 



4 Convergence under Perfectly Dependent Communication 

In this section, we study the case when the communication between nodes i and j is perfectly 
dependent, as described in the following assumption. 

A2. (Perfectly Dependent Communication) The events E^ = E^ except for a set with proba- 
bility zero for all k. 

Note that A2 is equivalent to assuming that P(Ej^|E^) = P(E^|E^) = 1. Hence, we have 
P^ = P^ and at each instant, with probability P^ = P^ = Pj7 , both E^ and E^ occur, and 
with probability 1 — Pk they both fail. With A2, the gossip algorithm can be expressed as 

x{k + l) = W{k)x{k), (8) 

where W(k) is the random matrix satisfying 

p(H^(fc) = T4^^,,)=/-i^i^^il^l^^)=^^I±^P,, ^^j (9) 

with Cm = (0 . . . 1 . . . 0)"^ denoting the nx 1 unit vector whose ?7i'th component is 1. Moreover, 
p{W{k) = Wi^u) =/) = !- E^>j "-"^Pk. 

The main result on a.s. consensus for the considered gossip algorithm under perfectly de- 
pendent communication is stated as follows. 

Theorem 1 Suppose Al (Weak Connectivity) and A2 (Perfectly Dependent Communication) 
hold. Global a.s. consensus is achieved if and only ifYl^=oPk = oo. 

Denote D = diag((ii . . .d„) with di = Yll=ii'^ij + '^ji)- ^^^ ^-S- e-consensus, we have the 
following conclusion. 

Theorem 2 Suppose Al (Weak Connectivity) and A2 (Perfectly Dependent Communication) 
hold. Clobal a.s. e-consensus is achieved if and only if there exist a constant p^ > and an 
integer T* > 1 such that X]fcLm*~ ^k ^ P* for all m > 0. In fact, we have 

rcon.(e)<3[log(l-^)-']"'loge-i + 0(l), (10) 

where Ag is the second smallest eigenvalue of D — {A + A^). 

Theorem [1] indicates that YlV=oP'k = oo is actually a threshold for the gossip algorithm to 
reach a.s. consensus. For e-consensus. Theorem [2] implies that X^^q-^^ ~ 0{m) is the threshold 
condition, which actually requires that X^fcLo ^k grows linearly as a function of m. 

10 



Remark 8 TheoremsUl and\^ rely on the fact that there are at least three nodes in the network. 
If the network contains only two nodes, then both Theorems [1\ and IM no longer hold true. This 
phenomenon is interesting since many consensus results in the literature are independent of the 
number of nodes, e.g., JM\M\M\M\M \E^- 

Let the random variable S,{ko,x^) denote the consensus Umit (supposed to exist), i.e., 

lira Xi{k) = ^, a.s. i = l,...,n. (11) 

fc— >oo 

Denote Xave = Y17=i ^i(^o)/'^ be the average of the initial values. Then the following conclusion 
holds showing that the average is preserved almost surely with perfectly dependent communi- 
cation. 

Theorem 3 Suppose Al (Weak Connectivity) and A2 (Perfectly Dependent Communication) 
hold. Then for all initial conditions x^ = x{ko) G H", we have 

n 
P(^x,(fc) = nXave, fe > fco) = 1- (12) 

i=l 

Consequently, we have P(C = a^ave) = 1 whenever the consensus limit exists. 

In the following two subsections, we will present the proof of Theorems [1] and [21 respectively. 
Theorem [3] follows from the proof Theorem [TJ 

The upcoming analysis relies on the following well-known lemmas. 

Lemma 2 Suppose < bk < I for all k. Then J2T=o ^k = oo if and only i/n^Q(l — bk) = 0. 
Lemma 3 log(l - t) > -2t for all < t < 1/2. 

4.1 Proof of Theorem [1] 

(Sufficiency.) This part of the proof is based on a similar argument as is used in |12] . Define 
^{^) — Z]r=i |a^i(^) ~ a^aveP) where | ■ | represents the Euclidean norm of a vector or the absolute 
value of a scalar. 

It is easy to verify for every possible sample and fixed instant k that VF(ij) of the random 
matrix W{k) defined in ^ and © fulfihs 

(i). l^(ij} is a doubly stochastic matrix, i.e., W/^(ij}l = 1 and l-^H^^jj^ = l'^; 

(n). Wi^ij) is a projection matrix, i.e., Wi^ij) = Wj^-s^Wi^ij). 

11 



Therefore, we have 

e(l(A: + 1) |x(A:)) = e((x(A; + 1) - x^^^)^ {x{k + 1) - Xavel) |x(A;)) 

= E(^{W{k)x{k) - x^,,l)^{W{k)x{k) - xavcl) \x{ky^ 
= e((x(A;) - x,,,lfw{kfWik){x{k) - Xavel) \xik)] 

= {x{k) - X^,,lfE{W{kfW{k)) {x{k) - Xavel) 

= {xik) - Xavel)^E(P^(A:)) {x{k) - Xavel) (13) 

Since W{k) is doubly stochastic, we know that the sum of the nodes' states, ^^=iXi{k), 
is preserved with probability one, and 1 is the eigenvector corresponding to eigenvalue 1 of 
E(W{k)) (Theorem [3] therefore holds). Thus, we can conclude from (J13p that 

E(L(A: + l)|x(fc)) <A2(E(t^(A:)))(x(A;)-Xavel)'^(x(A:)-Xavel) = A2(E(t^(fe)))L(A:), (14) 

where X2{M) for a stochastic matrix M denotes the largest eigenvalue in magnitude except for 
the eigenvalue at one. Here note that E[W{k)^ is symmetric and positive semi-definite. 
Now according to ([9]), we see that 

E{W{k))=I-^{D-{A + A^)). (15) 

Note that D — (A + A^) is actually the (weighted) Laplacian of the graph ^^_|_^t. With assump- 
tion Al, Qji^j^t is a connected graph (cf.. Remark [3|), and therefore, based on the well-known 
property of Laplacian matrix of connected graphs [3], we have A2 > 0, where A2 is the second 
smallest eigenvalue of D — (^4 + A ). On the other hand, since yl is a stochastic matrix, it is 
straightforward to see that 

y ttij + ttji < n (16) 

for alH = 1, . . . , n. According to Gershgorin circle theorem, all the eigenvalues of -D — (j4 + A ) 
are bounded by 2n. Therefore, we conclude from (llSp that for all k, 

X,(E{W{k)))=l-^Pk. (17) 

With ([n]) and ([IT]), we obtain 

E(L(fc + l)) < n X2(E{W{{}))Liko) = n (i-^P,)l(A:o), (18) 

i=ko i=ko 

12 



Therefore, based on Lemma [2] and Fatou's lemma, we have 



E Hm L{k)) < Mm B(L{k)) =0, (19) 

where hm^^oo L{k) exists simply from the fact that the sequence is non-increasing. This imme- 
diately implies 

P( lim Xi(fc)=Xave) =1. (20) 

fc— >-oo 

The sufficiency claim of the theorem thus holds. 

(Necessity.) From the definition of the gossip algorithm, we have 

n 

P[xi{k + 1) = Xi{k)) > 1 — P[i receives Xj{k)) ■ y, P(pair {i,j) is selected) 





i=i, 


j¥'i 


+ P(pair {j,i 


) is selected) 




n 


1/ N 

-[aij + aji) 




= 1 - KPk, 







(21) 



where hi = "^j^i j^i ^{aij + aji) , i = 1, . . . , n. Noting the fact that 



n n ^ 



Y.^' = Y. Yl -{aij+aji)=2-^aii<2, (22) 



n 

i=l i=l j=l, jj^i i=l 



there exists at least one node ai G V such that hai < 1 since n > 3. Moreover, assumption Al 

further guarantees that hi > 0,i = 1, . . . ,n, which implies that there exists another node 02 &V 

such that /laj < 1- 

Therefore, if X^^^q Pk < 00, we have 

00 
F{xaAk)=XaAko),k>ko)= l[{l-ha,Pk)=(Ti>0, i = 1, 2 (23) 

k=ko 

based on Lemma [2j Consequently, choosing Xai{ko) 7^ Xaiiko), consensus will fail with proba- 
bility (Ti(T2 > 0. This completes the proof. 

4.2 Proof of Theorem [2] 

(Sufficiency.) Recall that 7i{k) = maxj=i^...^„ 2;j(A;) — minj=i^...^„Xj(A;). Suppose nodes a and /3 
reach the maximum and minimum values at time k, respectively, i.e., 

Xaik) = max Xi{k); xp{k) = min Xi{k). 

i=l,...,n i=l,...,n 

13 



Then we have 



Lyk) — y ^ \Xi\k) Xavel ^ \XoL\k,) Xavel + l-^/Jv"-) -^a 



i=l 



On the other hand, 

n ^ n n 

L{ko) = ^\xi{ko) -2;avcP < —\^^\nxi{ko) - ^Xj{k^ 

1=1 i=l j=l 

i=l j=l, j^i 

< ^^^n\ko). 

n 
With (|24p and (|25p and applying Markov's inequality, we have 



n{ko) - . 



ynHko) - 

„ / L(k) n 

<P -4^ > 



L{ko) - 2(n-l) 
^ 2in-l) ^,,B{L{k)) 



n 



^ 2(n-l) ^_2 



n 



L{ko 

fc-i 

J=fco 



A^ 



2n 



>m+T*-l 



(24) 



(25) 



(26) 



where the last inequality holds from ([TH]) . Since X^^!^* ^ -Pfe > p* for all m > 0, according to 
the arithmetic mean-geometric mean inequality, we have that for all m > 0, 



m+T*-l 



n 1 



AS 



fc=m. 

As a result, we obtain 



2n 



T — -2. \^ 



Ao v^m+T*-l 



Pt\7^. 



fc=m ^'feV* / f^ ^2P* 



< 1 



2nr* 



c* < 1. 



(27) 



fc-i 

n(i-afi 



2n' 






''"% _1 



(28) 



where [zj denotes the largest integer no larger than z. 
Then (I2SD and {I2HD lead to 



/^^ X 2(n-l)^_2^^ 

v?^(fco) - ; - n 



a_i 



-e c* 



(29) 
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which imphes 



J- com I € J ^ J- ^ 



31oge i+log^^^^ 



log c* ^ 
The desired conclusion follows. 



log(l-^)"']"'loge-i + 0(l). (30) 



(Necessity.) We prove the necessity part of Theorem [5] by a contradiction argument. Let ai, 02 
be defined as in the proof of Theorem [TJ Set Xa^{kQ) = 0, Xq,2(^o) = 1 and Xj{ko) G [0, 1] for all 
other nodes. Then according to (|23p. we have 

P(:^^>e) >P(x«,(t) = a:a,(A:o), ^ = 1, 2; fco < t < A:) = J] (l - /i„^p,) (l - /j„^p,) 

(31) 

Take e = 1/^ with ^ = 1,2,... . Suppose suitable p* and T^, cannot be found such that 
YlT=m*" ^k > P* for all TO > 0. Then for any T = £logi, there exists an integer A; > such 
that Yl -t. Pt < 1/2. According to (|3ip and Lemma [3l we have 



k+T 

nik) 



^ C^^'^I^^'^ ^^) ^ n (i-/^..^o(i-/^«.^*) 



t=k 

= gE^i* (l0g(l-hciPt)+l0g(l-/lc2Pt)) 

> 1/^ (32) 

for all £ > e'*"i+'^"2. This immediately imphes Tcom(l/^) > ^log£, which suggests that Tcom(e) = 
0(loge^^) does not hold. The proof has been completed. 

5 Convergence under Independent Communication 

In this section, we focus on the case when the communication between nodes i and j is inde- 
pendent, as described in the following assumption. 

A3. (Independent Communication) The events E^ and E^ are independent for all k. 

Remark 9 Symmetric and asymmetric randomized gossip algorithms were studied in 133]. The 
symmetry in 133] is a deterministic concept where the gossip algorithm is either symmetric or 
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asymmetric, and therefore, it is binary. While the dependence discussed in this paper carefully 
characterizes how much symmetry is missing from a probabilistic viewpoint. The symmetric 
model in JSSf is a special case of ours when P^ = P^ = 1 for all k (in this case perfectly 
dependent communication coincides with independent communication). The asymmetric model 
in f33f is a special case of our independent communication model with P^ = 1 and P^ = for 
all k. 

With A3, E||, Eq", E]^, E]^, ... is a sequence of independent events, and the considered gossip 
algorithm can be expressed as 

x{k + l) = W(k)x(k), (33) 

where W{k) is the random matrix satisfying 

(i) p(H^(fc) = /-M£i_£2)I) = !^P+(l_p-) + ^p-(l_P+), i^j. 

(ii) P(w{k) = I - (£iz£2fc£i)I) = !hi±22ip+p~^ i > j. 



(iii) P(w{k) = /) = !- ^^^^. ^^(p+ + P- - P+P^ 

In order to estabhsh the convergence results under independent communication, we need the 
following condition for the underlying connectivity. 

A4. (Double Connectivity) Both the underlying graph Qq and its converse graph Q^ are quasi- 
strongly connected. 

Remark 10 Note that the condition of Qq being strongly connected implies A4, but not vice 
versa. Moreover, it is not hard to see that Go = Qa, and Qq = Q^r, where Qa and Gat are the 
induced graph of A and A^ , respectively. 

We now present the main result on a.s. consensus under independent communication. 

Theorem 4 Suppose A3 (Independent Communication) and A4 (Double Connectivity) hold. 
Global a.s. consensus is achieved if and only ifJ2'h=o(^k ^ ^k) ~ °^- 

Denote d* = max {diam(CyA),diam(^A.T)}, where diam(tyA) and diam(Cy^T) represent the 
diameter of the induced graph of A and A^ , respectively. Take £'* = !^o| — Z^ILi ^Sii('^««)' 
where |£'o| represents the number of elements in 8q, and sgn(2;) is the sign function. Introduce 
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a* = minjajj : aij > 0, i,j = l,...,n, i 7^ j} as the lower bound of the nonzero and 
non-diagonal entries in the meeting probability matrix A. For a.s. e-consensus, the following 
conclusion holds. 

Theorem 5 Suppose A3 (Independent Communication) and A4 (Double Connectivity) hold. 
Global a.s. e-consensus is achieved if and only if there exist a constant p* > and an integer 
r* > 1 such that X^^=^~ {Pjl + PjT) > p* for all m > 0. In this case, we have 

iog(i-fe)''T 

where Oq = (2d* - 1){2E^ - 1). 



Tcomie) < ^ ^^^"^'^y* ^^ log6-i + 0(l), (34) 



We also have the following conclusion indicating that the expected value of the consensus 
limit ^ equals the initial average Xavc if P^ — Pk • 

Theorem 6 Suppose A3 (Independent Communication) holds and the consensus limit exists. 
Then E(0 = Xave «/ P^ = Pk for a^ k. 

With independent communication, when P^ = P^ for all k, it is not hard to see that 
for any k = 0,1,..., E(W{k)) is a doubly stochastic matrix since it is both stochastic and 
symmetric. Thus, Theorem [6] holds trivially. Furthermore, we have another conclusion showing 
that whenever the consensus limit ^ exists, the average can almost never be preserved. 

Theorem 7 Suppose A3 (Independent Communication) holds and the number of nodes, n, is 
odd. Assume that Pj^ , P^ € [0, 1 — e] for all k > with < e < 1 a fixed number. Then for any 
ko > and for almost all initial conditions x^ = x^ko) € II", we have 

n 

PJy^^Xiik) = nxave, k>ko]=0 (35) 

if the consensus limit ^ exists. 

Remark 11 Surprisingly enough Theorem^ relies on the condition that the number of nodes is 
odd. This is not conservative in the sense that we can easily find an example with even number 
of nodes such that P( X^ILi •^«(^) ~ ^^ave; k > /cq) > conditioned the consensus limit exists. 

Remark 12 Note that perfectly dependent communication coincides with independent commu- 
nication when P^ = P^ = 1. This is why we need P^ and P^ to be at a distance from one in 
the assumption of Theorem^ 
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Regarding the non-conservativeness of A4 (Double Connectivity) to ensure a consensus under 
independent communication, we have the following conclusion. 

Proposition 1 Suppose A3 (Independent Communication) holds. Then the condition X]fc^o(-^fc'~'~ 
PfT) = oo always ensures an a.s. consensus only if A^ (Double Connectivity) holds. 

Proof. Assume that A4 does not hold. Then either Qq or Qq is not quasi-strongly connected. 

Let us first discuss the case when Qq is not quasi-strongly connected. There will be two 
distinct nodes i and j such that VinV2 = 05 where Vi = {nodes from which i is reachable in Qq} 
and V2 = {nodes from which j is reachable in ^o}- The definition of the node pair selection 
process then implies 

(m: a^fc >0,A: E Vr} C Vr, r = l,2. (36) 

Take P^ = for all k. Then each node in Vt,t = 1,2 can only be connected to the nodes in 
the same subset. Consensus will then fail with probability one even with X^fc^o ^k = 00 if we 
just take Xmik^) = for m G Vi and Xmi^o) = 1 for m G V2- 

Similar analysis leads to the same conclusion for the case with Qq not quasi-strongly con- 
nected. This completes the proof. D 

Remark 13 It was shown in Remark [5| that Al (Weak Connectivity) is a lower hound for 
the underlying connectivity ensuring a consensus, and Theorem [1\ indicated that under Al, the 
condition YlT=o(^k + ^k) ~ °^ ensures an a.s. consensus under independent communication. 
Therefore, combining Remarkl^ Theorem\^ Theorem^ and PropositionUl we can conclude that 
in terms of consensus convergence of the randomized gossip algorithm, Al (Weak Connectivity) 
is critical for perfectly dependent communication, as is A4 (Double Connectivity) for independent 
communication. 

Remark 14 For perfectly dependent communication, Theorem shows the consensus limit 
equals the initial average with probability one. While for independent communication, we see 
from Theorem that only the expected value of the consensus limit equals the initial average 
with an additional condition P^ = P^ . 

Remark 15 We see from il2\} and i35\) that with odd number of nodes, average preserving 
turns from almost forever (with probability one for all initial conditions) with perfectly dependent 
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communication, to alm,ost never (with probability zero for almost all initial conditions) for the 
independent case. As has been shown widely in the classical random graph theory JQ Q \^, the 
probability of a random graph to hold a certain property often jumps from one to zero when a 
certain threshold is crossed. Now we can conclude that communication dependence provides such 
a threshold for average preserving of the considered randomized gossip algorithm. 

With independent communication, we have W{k) = I — "^^ ^'^'^^ with a nontrivial probabil- 
ity in (j33p . The matrix / — ^ ^"^^ is neither doubly stochastic nor symmetric, and it is not a 
projection matrix. These properties are critical for the convergence analysis in Theorems [T] and 
[2] to stand. Hence, we need new methods to analyze the convergence property under assumption 
A3. 

The rest of this section is organized as follows. In Subsection 5.1, we establish several useful 
lemmas for the convergence analysis. Proofs of Theorems [H [5] and [7] are given in Subsection 5.2. 

5.1 Key Lemmas 

In this subsection, we first establish an important property of the Bernoulli trials defined by 
the node communication process. Then, we investigate the product of the stochastic matrices 
derived from the gossip algorithm. 

5.1.1 Bernoulli Communication Links 

Define two (independent) sequences of independent Bernoulli trials 

*R+ fR+ fR+ 

' 1 ' 2 1 • • • ' 
■^0 5 ^i 5 ^2 ' • • • ; 

such that P(«B+ = l) = P^ and P(*B^ = l) = Pk ■ Then let 

5So,5Si,5S2,..., 

denote the independent Bernoulli trials given by 53^ = 1 if and only if 5S^ + 5S^ > 1. 

Clearly 55^ = 1 characterizes the event that at least one side of the communication succeeds 
at time k. In order to establish the success times property of the process of {5Sfc}o°, we need 
the following lemma. 

Lemma 4 Suppose {bk},{ck} are two sequences satisfying b^^Ck € [0,1], k = 0,1,.... Then 
Y^kLo {h + Cfc - bkCk) = oo if and only if ^^g (6^ + Cfc) = oo. 
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Proof. It is straightforward to see that 

^{h + Cfc) < max{6fc, Cfc} < bk + Ck - hkCk < bk + c^. (37) 

Then the desired conclusion fohows trivially. D 

The following lemma holds on the success times of {*Bfc}[^. 

Lemma 5 P(/or all k^ > 0, 03,^ = 1 for infinitely many k > /cq) = 1 ^/ O'lT-d only ifYlT=o(^k ~^ 

Pk) = ^■ 

Proof Note that, we have P(5Sfc = l) = 1 - (1 - P+)(l - P^) = P+ + P^ - P+P^ = Zk. Then 

P{^k = 0,k>ko) = l[{l-Zk), (38) 

k=ko 

and the necessity part of the conclusion follows immediately from Lemmas [2] and HI 

On the other hand, since "^kLoiPjl + ^k) — ^^ implies Y[h=s (^ ~ ^k) = for all s > 0, we 
have 



P(3m > 0, 5Sfc = 1 for finitely many k > nij < 2, P(^fc = 1 for finitely many k > rnj 

m=0 

oo oo 

< ^ ^ P(«fc = for A: > s) 

771=0 s=m 

oo oo oo 

771=0 s=mk=s 

= 0. (39) 

This leads to P(for all /cq > 0, 25^ = 1 for infinitely many k > /cq) = 1- We have now completed 
the proof. D 

5.1.2 Products of Transition Matrices 

The considered gossip algorithm is determined by the possible samples of the transition matrix 
W{k). Denote M = Mi U Ms with 



il 



I - ' \ ^ — ■ o^ij + aji > 0, i,j = l,...,n, i^ jj 



and 



112 



/ 



V^i (^j){(ii Cjj 



: aij +aji>0, i,j = 1,... ,n, i> jj. 



2 
Then M is the set including all samples of W{k) except for the identity matrix /. The following 

lemma holds on the product of matrices from M. 
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Lemma 6 Let Mk £ M, k = 1, . . . ,N be N > 1 matrices in M. Then we have (Uili ^A/,) Q 
Qmm-Mi- Moreover, all nonzero entries of the product Mp^ ■ ■ ■ Mi have lower hound 2~^ . 

Proof. We just prove the case for N = 2. Then the conclusion fohows immediately by induction. 

Note that the nonzero entries of any matrix in M is no smaller than 1/2. Moreover, all 

matrices in M have positive diagonal entries. Therefore, denoting rhij, rhij and m^, as the 

ij-entries of Mi, M2 and M1M2, respectively, we have 

n 
'^iii2 = Yl '^hjiTT'ji2 > ^iii2/2 + rni^hfi- (40) 

This implies rn*^^^ > as long as at least one of mj^jj ^''^^ ^iii2 is non-zero. Furthermore, if 
m*^^2 > 0, it is straightforward to see that m*_^-^ > 1/4. This completes the proof. D 

Recall that d* = max {diam(^A),diam(^AT)}, where diam(^A) and diam(t7^T) represent 
the diameter of the induced graph of A and A , respectively. We have the following lemma. 

Lemma 7 Suppose A4 (Double Connectivity) holds. Let Mi, . . . , M2d,-i be 2d* — 1 products of 
some finite matrices in M. If Qa ^ Gmi, '^'^ Sat — Sm^ holds for any k = 1, . . . , 2(i^, — 1, then 
M2dt-i ■ ■ ■ -^1 ^s ffl scrambling matrix. 

Proof. Since for any k, we have Qa ^ 0M^. or Q^^t C Gm^, there must be one of Qa ^ QaIj, °^ 
Q^T C Qpf^ happens at least d* times for k = 1, . . . , 2d* — 1. Without loss of generality, we 
just focus on the case that Ga ^ Givik happens at least d* times. Thus, there exist 1 < ki < 
k2 < • • • < kd, < 2d* — 1 such that Qa ^ Smj, > s = 1, . . . ,d*. We now separate the product 
M2d,-i • • • Ml into the product of d* matrices: 

Ml = Mk, ■ ■ ■ Mi; Mj = Mk^ ■ ■ ■ Mk^_^+i,j = 2, . . . , d* - 1; M^, = Msd.-i • • • M^^^^+i. 

Then M2d,-i ■ ■ ■ Mi = Md, ■■■Mi. Since Qa C Qm^, for each kg, we have Qa C Qm^^ ^ ^Ms^^ = 
1, . . . , d* directly from Lemma [6l 

Suppose io is a center node of Qa- Then for any other node jo, there must be a path zq -^ jo 
in Qa with length no larger than d*. We assume d(zo, jo) = do < d*. We redenote jo as i^g, and 
let ioco^i ■ ■ ■ ^do'^'do be a shortest path from zq to id^ in the graph Qa- 

- - (s) - - 

We denote the ij-entry of Mg as M-- for s = 1, . . . , d*. Also denote the ij-entry of M2M1 
as Mj^- . The fact that {io,ii) £ £a ^ £mi immediately implies M^J- > according to the 

_ i2\ 

definition of induced graph. Similarly we have M^^l^ > because (^1,^2) € ^A ^ ^M2' Thus, we 
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obtain 

n 

MJ^l^ = y M^ImH^ > Mfl M^ > 0. (41) 

T=l 

Similar analysis can be proceeded until we eventually obtain 

^{do>...(i) Q .^2) 

where iW^ ° denotes the vzo-entry of M^q ■ ■ ■ Mi. 

Denote the rio-entry of M2d^,-i ■ ■ ■ Mi as M*^^. Noting that ([12|) holds for arbitrary i^^, we 
see from Lemma [6] that 

Therefore, according to the definition of A(-) in ([T|), obtain 

X{M2d,-i ... Ml) < 1 - min M/,,, < 1. 

j = l,...,n -^ 

We have now proved the conclusion. D 

/ — — — ^^^ — ^-'— : i,j = l,...,n, i > j >. The following lemma 
is on the impossibility of the finite-time convergence for the product of matrices in M2 . 

Lemma 8 Suppose n is an odd number. Take matrices M^- € M2, t = 1, . . . ,k,k > 1 arbitrar- 
ily. Then we have 5{Mk • • • Mi) > 0, where 5{-) is defined in ^\). 

Proof. We prove the lemma by a contradiction argument. Suppose there exist an integer /c* > 1 
and fc* matrices Mr € Mg, r = 1, . . . ,k^: satisfying S{Mk^, ■ ■ ■ Mi) = 0. Therefore, there exists 
/? € R*^ as an n X 1 vector such that 

Mfc, . . . Ml = 1/3^, (44) 

which implies 

Mk,---Miy = ip'^y = zol (45) 

for all y = {yi . . . yn)'^ € El", where zq = p'^y is a scalar. Since M,- G Mj, r = 1, . . . , /c*, the 
average of yi, . . . , y„ is always preserved, and therefore, zq = XlILi y*/"" 

Let n = 2no + 1 since n is an odd number. Take yj = for i = 1, . . . ,no and yi = 2 *^^ 
for z = no + 1, ... , 2no + 1. Then zq = 2^*^^{nQ + l)/(2no + 1). On the other hand, it is 
straightforward to verify that each element in M^^ . . . Miy can only be an even number, say, Sq. 
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Since 5*0 is even, we have 5*0 = 2"S'* for some < a < /c* an integer and 5"* an odd number. 
This leads to 

which imphes 

2fc.+i-a(^^ + 1) = (2no + 1)5*. (47) 

Clearly it is impossible for ()47p to hold true since its left-hand side is an even number, while 
the right-hand side odd. Therefore such Mj- G Mg, r = 1, . . . ,k^ does not exist and the desired 
conclusion follows. D 

5.2 Proofs 

We are now in a place to prove Theorems HI [5] and [71 

5.2.1 Proof of Theorem H 

(Necessity.) We have 

P[xi{k -|- 1) = Xi{k)) > 1 — y^ P(pair (j, i) is selected) • P(i receives Xj{k)) 

+ P(pair {i,j) is selected) • P(i receives Xj{k)) 



>l-/iimax{P+,p-} (48) 



where hi is introduced in ([2T|) . Note that, from ([ST]) . we know X]fclo(^^ + ^k) = oo if and 
only if Yl'k^=o max {P/T , P^} = oo. Therefore, with ()48p . the necessity part of Theorem H] follows 
immediately from the same argument as the proof of the the necessity part in Theorem [TJ 

(Sufficiency.) Recall that the considered gossip algorithm is determined by the random matrix, 
W{k) in (p3|) . Denote the induced (random) graph of W{k) as Gw{k) = iy->^w{k))- Global a.s. 
consensus is equivalent to P(limfc^oo ^{^k ■ ■ ■ W2W1) = O) =1. 
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With independent communication, we have 

P( {i,j) G Sw{k) ^^ least one of *B^ and *B^ succeeds at time kj 



1-(1 


-n^)(l- 


Pk 


-) 


a.j 


Pk+<^^^Pk 






<P^ 


+ Pk-P^ 


Pk 


) 


^nPk 


+ <^^3Pk 







> '\\ '' ^ (49) 

for all i, J = 1, . . . , n, i ^ j. Here without loss of generality, we assume PjJ' + P^ > 0. 

Recall that a* = minjajj : aij > 0, z, j = 1, . . . , n, i / j} is the lower bound of the nonzero 
and non-diagonal entries in the meeting probability matrix A. Based on (|49p . there are two 
cases. 

(i) When Pj^ > PjT , for alH, j = 1, . . . , n, i ^ j with a^j > 0, we have 

Pf {j,i) G £w{k) at least one of 53^ and 5S^ succeeds at time A: j > — ^. (50) 

(i) When P^ < P^, for alH, j = 1, . . . , n, i j^ j with a^j > 0, we have 

P( (i, j) G £w{k) at least one of 55^ and 55^ succeeds at time A; j > -^. (51) 

Now we introduce the Bernoulli (success) sequence of 'Bq, 'Bi, ^2, • • • as 

< Ci < C2 < • • • : 5Sc^ = 1 

where Cm is the time of the ?7i'th success. Since J2T=o {Pk ^ Pk) ~ °°' Lemma [5] guarantees 
that C,rn < 00 a.s. for all m = 1,2,.... For simplicity, we assume /cq = in the rest of the proof. 
Prom (150]) and (fSTjl . for Ci < C2 < • ■ ■ j we can take a sequence of arcs (ii, j'l), (^2, J2)) • • • 
such that 

P((im,Jm) G ^Ty(u)) ^ ^' (52) 

where im 7^ Jm for all ?n, = 1, 2, . . . and either ai^j^ > or «■;■„«,„ > holds. Moreover, since the 
node pair selection process and the node communication process are independent for different 
instances, events {{im,jm) G £yy(^^^-^,m = 1,2,...} are independent. 
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Recall that i?* is the number of non-self- looped arcs in the underlying graph. From the double 
connectivity assumption A4, it is not hard to see that we can select (ii, ji), {i2,J2)i ■ ■ ■ , (^2_B*-i) 
J2E,-i) properly such that 

2E,-1 2E,~1 

Qa^ \J {{im,3m)] or QaT ^ [J {{im^jm)] 
m=l m=l 

holds. Thus, based on Lemma [H denoting Qi = VF(^2b*-i ' ' ' ^Ci' ^^ have Qa ^ Gqi or Qa^ ^ 

Similarly, denoting Qr = W'^(2b,_i), • • • ^C(2i5.-i)(r-i)+i for r = 1, 2, . . . , we have Qa Q Qq^ 
or Qat ^ Qq^ for all r. According to Lemma [71 we have 

^{x{Q2a.-.---Qr)<l)>[^r''-''''''-''. (53) 

Moreover, since Q2d,~i ■ ■ -Qi is a product of 9q = [2d^, — 1){2E^ — 1) matrices in M, Lemma [6] 
further ensures 



'(A(Q2d.-i---Qi)<l-2-^")>(J)'°. (54) 



2n' 

Continuing the analysis, we know that for all Fs = Q{2dt~i)s ' ' ' Q(2d*-i)(s-i)+i! s = 1, 2, . . . , we 
have 

pfAfF,)<l-2-^")>(|^)'«, s = l,2,..., (55) 



which implies 



e(a(F.))<1-(^)''\ . = 1,2,.... (56) 



With Fatou's lemma and Lemma [TJ we finally have 

m 

e( lim 6{Fm---Fi)) < lim e(5(F„ • • • Fi)) < lim E("rTA(F,)) =0, (57) 

s=l 

which implies 

Pf lim 5{Fm---F,)=0) =1. (58) 

Note that, ([55]) leads to P( limfc^oo ^^k ■ ■ ■ Wi) = O) = 1 since the definition of F^, s = 1, 2, . . . 
guarantees W^ = I for all k ^ {Ci, C2) • • • }• This completes the proof. 
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5.2.2 Proof of Theorem [5] 

Similar to the proof of Theorem [H the necessity part of Theorem [5] fohows from the same 
argument as the one used in the proof of Theorem [2j Here we just focus on the sufficiency 
statement of Theorem [5l For simplicity we assume /cq = 0. 

Denote the zj-entry of W{k — 1) • • • M^(0) as ^ij{k). Then for all i,j and a, we have 



Xi( 



i{k) - Xj{k)\ = I Y^ ^irn{k)x„,{Ql) - Y^ ^ jm{k)x.m{Q)\ 

m=l m=l 

n n 

= I Y ^im{k)ixm{0) - X„(0)) - Y ^jm{k){xm{0) - X„(0)) 
m=l m=l 

n 

< yi \'^im{k) - *jm(A:)| • max \xm{0) - Xa{0)\ 

m=l 

<n6{W{k-l)---W{0))n{0), (59) 

which implies 

n{k) <n6{Wik-l)---W{0))n{0). (60) 

Then we introduce pk = X^^^j^g ^m- From Markov's inequality, we have 
.(Wk) , N ,_/ , . J 



^[ji^^^)in"^'nt-n-wm)>^^ 



< -E(d{W{k-l)---W{0) 

ft / 

<^e(a(Flp^j)---A(Fi)), (61) 

where 6q and Fm,m = 1, 2, . . . are defined in the proof of Theorem HI Thus, (j56p and (j6ip lead 
to 

E(A(F|P^l)...A(Fi))=E(E(A(F|P^l)...A(Fi)|/.fc 

Am) 



L^j; -v-i;;--^-^-vM9^J. 



l-(^) 



4n^ 
< ( 1 _ ('^\^<M "o 



('-©"") 



■^771 — Q V fc fc fc fc 



--1 



S^m=o V fc ^-'^fc y ^ 



< ( 1 _ (^)«0^ 280 - (g2) 
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since both the node pair selection process and the node communication process are independent 
for different instances. 

Since there exist a constant p* > and an integer T^ > I such that X]fc=^~ i^k ^-^k) — P* 
for all m > 0, (f6T]) and ([62]) imply 



and thus 

log(l-(f^)") 
The desired conclusion follows. 



TcoM < . . .11,-1 ^°g^'' + Q(l)- (64) 



5.2.3 Proof of Theorem 

With independent communication, it follows from Theorem [J] that Y^^=o {-^k ~^ ^k) = c« if the 
consensus limit ^ exists. It is not hard to see that 

1 - ^-^ {P^ + P,- - 2P+P,-) . (65) 



n 
Since P^, P^ G [0, 1 - e] for aU A; > with < e < 1, we have 

K + Pk - '^Pk^k = Pk + Pk - PkPk - PkPk 

> max{P+, P-] -{l-e) max{P+, P^} 

= emax{P+,P^"} 

>|(^.+ + n~)- (66) 

As a result, we have Xlfc^o {Pk + -^aT ~ "^Pk Pk) ~ °^- -^^ ^ similar argument as we obtain 
Lemma [5l for any /cq > 0, we have 

viwik) G Ml for infinitely many k with A; > fco) = 1 (67) 

conditioned that the consensus limit ^ exists. 

Then we show that for almost all initial conditions, it is impossible to generate finite-time 
convergence along every sample path {W^ {k)}'^' of the random matrix process {Wik)}'^^ which 
satisfies W^^k) G M2 for all k. For any /c = 1, 2, . . . , we define 

X^ = {x° gR" : 3zGE,ri GM2,i = 1,...,A; s.t. Tk- ■ -Tix^ = zl] . 
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Suppose Fi, . . . , Ffc G M2. We denote 

rfc---ri = ($i...$„,)^ (68) 

where ^m is the ?Ti'th column of F^ • • • Fi. With LemmaEl we know that 6{Tk ■ ■ ■ Fi) > 0, which 
imphes 0?=! (span{<I>j}) is a linear space with dimension no larger than n — 2 noticing that 
Ffc • • • Fi is a stochastic matrix. Therefore, since F^ • • • Fix'^ = zl leads to F^ . . . Fi (^x^ — zl) = 0, 
we have 

Ik= U ]R'x3^r,...ri, 

ri,...,rfe6M2 

where 3^rfc-ri = {y G E." : F^ • • • Fiy = 0} = fllLi (span{$i})^. Noting the fact that M2 is a 
finite set, we further conclude that Me(Xfc) = 0, where Me(5) for S C R" denotes the standard 
Lebesgue measure in R". This immediately implies 

Me(|JX,,)=0. (69) 



k=l 

Now we observe that if W''^{k) = I — '^"'■'^"^ ^^' for some u,v G V, n 7^ v and k > ko, 
Y17=i ^i{^) — ^^^ave implies x^ik) = x^{k). Therefore, in this case W^{k) can be replaced by / 
without changing the value of x'^(A; + 1). 

Since the node pair selection process and the node communication process are independent 
with the nodes' states, we conclude from (1671) and (1691) that 



P 



n 

( /.^*(^) ~ "-^^avo k > k() and consensus limit ^ exists 



^ exists 



n 



P( > Xi{k) = nxavei k > ko and consensus achieved in infinite time 



i=l 



^ exists 



+ P( y^Xi{k) = nxave, k > ko and consensus achieved in finite time S, exists] 
< Pixiik) = Xj{k) whenever W{h) = I - ^'^^' ~ ^^' G Mi, fc > /cq and 



consensus achieved in infinite time 



^ exists 



+ ^ Y'[W{t) GM2, T = kQ,...,kQ + m s.t. W {k^ + m) ■ ■ ■ W {kQ)x^ = zl ^ exists) 

m=0 

00 

0+Eo 

m=0 

(70) 
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for all x^ E R" except for IJfcLi^fcj which is a set with measure zero. 
The desired conclusion follows immediately. 

6 Conclusions 

This paper presented new results on the role of unreliable node communication in the convergence 
of randomized gossip algorithms. The model for the random node pair selection process is defined 
by a stochastic matrix which characterizes the interactions among the nodes in the network. A 
pair of nodes meets at a random instance, and two Bernoulli communication links are then 
established between the nodes. Communication on each link succeeds with a time-dependent 
probability. We presented a series of necessary and sufficient conditions on the success probability 
sequence to ensure a.s. consensus or e-consensus under perfectly dependent and independent 
communication processes, respectively. The results showed that the communication symmetry 
is critical for the convergence. 

The results are summarized in the following table. We notice the following characteristics: 





Perfectly Dependent 
Communication 


Independent 
Communication 


Critical Underlying 
Connectivity 


Weak Connectivity 


Double Connectivity 


Consensus Limit = 
Initial Average 


Almost Surely 


In Expectation 


Average Preserved 


Almost Forever 
(for all n) 


Almost Never 
(for odd n) 



Figure 1: Summary of the properties of the random gossip algorithms considered in the paper. 
Perfectly dependent and independent communication gives drastically different behavior. 

• In terms of consensus convergence of the randomized gossip algorithm, Al (Weak Connec- 
tivity) is critical for perfectly dependent communication, as is A4 (Double Connectivity) 
for independent communication. 

• For perfectly dependent communication, the consensus limit equals the initial average 
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with probability one. While for independent communication, only the expected value of 

k — ^k 



the consensus limit equals the initial average for the special case Pu = Pu ■ 



• Average is preserved almost forever (with probability one for all initial conditions) with 
perfectly dependent communication, and it is preserved almost never (with probability 
zero for almost all initial conditions) with independent communication if the number of 
nodes is odd. 

The results illustrate that convergence behavior of distributed algorithms may heavily depend 
on the probabilistic dependence properties in the information flow. 
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